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Field of the Invention 
20 The present invention relates generally to the field of 

molecular biology. More specifically, the present invention relates 

1 



to molecular cloning and characterization of homologous 28-kDa 
protein genes in Ehrlichia canis, a multigene locus encoding the 28- 
kDa homologous proteins of Ehrlichia canis and uses thereof. 
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Descriptio n of the Related Art 

Canine ehrlichiosis, also known as canine tropical 

O 

~ n pancytopenia, is a tick-borne rickettsial disease of dogs first 

tn 

FU described in Africa in 1935 and the United States in 1963 (Donatien 

m 

! ^10 and Lestoquard, 1935; Ewing, 1963). The disease became better 

5 

g recognized after an epizootic outbreak occurred in United States 

f* 

Ul military dogs during the Vietnam War (Walker et al y 1970) 

The etiologic agent of canine ehrlichiosis is Ehrlichia 
canis, a small, gram-negative, obligate intracellular bacterium which 

15 exhibits tropism for mononuclear phagocytes (Nyindo et al., 1971) 
and is transmitted by the brown dog tick, Rhipicephalus sanguineus 
(Groves et ai 9 1975). The progression of canine ehrlichiosis occurs 
in three phases, acute, subclinical and chronic. The acute phase is 
characterized by fever, anorexia, depression, lymphadenopathy and 

20 mild thrombocytopenia (Troy and Forrester, 1990). Dogs typically 
recover from the acute phase, but become persistently infected 

2 



carriers of the organism without clinical signs of disease for months 
or even years (Harrus et ai t 1998). A chronic phase develops in 
some cases that is characterized by thrombocytopenia, 
hyperglobulinemia, anorexia, emaciation, and hemorrhage, 
5 particularly epistaxis, followed by death (Troy and Forrester, 1990). 

Regulation of surface antigenicity may be an important 
mechanism for the establishment of such persistent infections in the 

i3 host. Although disease pathogenesis is poorly understood, 

Q 

. multigene families described in members of the related genera 

HI 

njlO Ehrlichia, Anaplasma, and Cowdria may be involved in variation of 

s major surface antigen expression thereby evading immune 

Q .......... 

jjj surveillance. Anaplasma marginale, an organism closely related to £ 

q canis, exhibits variation of major surface protein 3 (msp-3) genes 

m 

> -3- 

resulting in antigenic polymorphism among strains (Alleman et aL, 
15 1997). 

Molecular taxonomic analysis based on the 16S rRNA 
gene has determined that K canis and E. chaffeensis, the etiologic 
agent of human monocytic ehrlichiosis (HME), are closely related 
(Anderson et ai, 1991; Anderson et al, 1992; Dawson et al. t 1991; 
20 Chen et al., 1994). Considerable cross reactivity of the 64, 47, 40, 
30, 29 and 23-kDa antigens between E. canis and E. chaff eens is has 
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been reported (Chen et aL, 1994; Chen et aL, 1997; Rikihisa et aL, 
1994; Rikihisa et aL, 1992). Analysis of immunoreactive antigens 
with human and canine convalescent phase sera by immunoblot has 
resulted in the identification of numerous immunodominant 
5 proteins of E. canis, including a 30-kDa protein (Chen et aL, 1997). 
In addition, a 30-kDa protein of E. canis has been described as a 
lB major immunodominant antigen recognized early in the immune 

o 

□ response that is antigenically distinct from the 30-kDa protein of E 

s 

RJ chaffeensis (Rikihisa et aL, 1992; Rikihisa et aL, 1994). Other 

m 

fit 

: p 10 immunodominant proteins of £. canis with molecular masses ranging 

5 

p from 20 to 30-kDa have also been identified (Brouqui et aL, 1992; 

W Nyindo et aL, 1991; Chen et aL, 1994; Chen et aL, 1997). 

M* 

O 

«y Homologous 28-32kDa immunodominant proteins 

encoded by multigene families have been reported in related 

15 organisms including, E. chaffeensis and Cowdria ruminantium 
(Sulsona et aL, 1999; Ohashi et al., 1998a; Reddy et aL, 1998). 
Recently, characterization of a 21 member multigene family 
encoding proteins of 23 to 28-kDa has been described in E 
chaffeensis (Yu et aL, 2000). The E. chaffeensis 28-kDa outer 

20 membrane proteins are surface exposed, and contain three major 
hypervariable regions (Ohashi et aL, 1998a). The recombinant E 



chaffeensis P28 appeared to provide protection against homologous 
challenge infection in mice, and antisera produced against the 
recombinant protein cross reacted with a 30-kDa protein of E. canis 
(Ohashi et al., 1998a). Diversity in the p28 gene among E 
chaffeensis isolates has been reported (Yu et al., 1999a), and studies 
using monoclonal antibodies have .further demonstrated diversity in 
the expressed P28 proteins (Yu et al., 1993). Conversely, complete 
conservation of a p28 genes in geographically different isolates of E 
canis has been reported and suggests that E: canis may be conserved 
in North America (McBride et al., 1999, 2000). 

The prior art is deficient in the lack of cloning and 
characterization of new homologous 28-kDa immunoreactive. 
protein genes of Ehrlichia canis and a single multigene locus 
containing the homologous 28-kDa protein genes. Further, The 
prior art is deficient in the lack of recombinant proteins of such 
immunoreactive genes of Ehrlichia canis. The present invention 
fulfills this long-standing need and desire in the art. 
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SUMMARY OF THE INVENTION 



Certain embodiments of the present invention describe 
the molecular cloning, sequencing, characterization, and expression 
5 of homologous mature 28-kDa immunoreactive protein genes of 
Ehrlichia canis (designated p28-l\ -2, -3, -5, -6, -7, -9), and the 
identification of a single locus (10,677-bp) containing nine 28-kDa 
p protein genes of Ehrlichia canis (p28-l to p28-9). Eight of the p28 
genes were located on one DNA strand, and one p28 gene was found 

ru 

m 

py 10 on the complementary strand. The nucleic acid homology among 

lp 

s the nine p28 gene members was 37 to 75%, and the amino acid 

Q 

\\ homology ranged from 28 to 72%. 

p In one embodiment of the present invention, there are 

5 

provided DNA sequences encoding a 30-kDa immunoreactive 
15 protein of Ehrlichia canis. Preferably, the protein has an amino acid 
sequence selected from the group consisting of SEQ ID No. 2,4,6, 
40, 42, 44, 46 and the gene has a nucleic acid sequence selected 
from the group consisting of SEQ ID No. 1, 3, 5, 39, 41, 43, 45 and is 
a member of a polymorphic multiple gene family. Generally, the 
20 protein has an N-terminal signal sequence which may be cleaved 
after post-translational process resulting in the production of a 

6 



mature 28-kDa protein. Furthermore, the genes encoding 28-kDa 
proteins are preferably contained in a single multigene locus, which 
has the size of 10,677 bp and encodes nine homologous 28-kDa 
proteins of Ehrlichia canis. 

In another embodiment of the present invention, there is 
provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, 
there is provided a recombinant protein comprising an amino acid 
sequence selected from the group consisting of SEQ ID No. 2,4,6, 
40, 42, 44, and 46. Preferably, the amino acid sequence is encoded 
by a nucleic acid sequence selected from the group consisting of 
SEQ ID No. 1, 3, 5, 39, 41, 43, and 45. Preferably, the recombinant 
protein comprises four variable regions which may be surface 
exposed, hydrophilic and antigenic. The recombinant protein may 
be useful as an antigen. 

In yet another embodiment of the present invention, 
there is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
expression region comprising a sequence encoding the amino acid 



sequence selected from the group consisting of SEQ ID No. 2, 4, 6, 
40, 42, 44, and 46 operatively linked to a promoter; transfecting the 
vector into a cell; and culturing the cell under conditions effective 
for expression of the expression region. 

The invention may also be described in certain 
embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject prior to 
exposure or suspected of being exposed to or infected with Ehrlichia 
canis; and administering a composition comprising a 28-kDa antigen 
of Ehrlichia canis in an amount effective to inhibit an Ehrlichia canis 
infection. The inhibition may occur through . any means such as, 
e.g., the stimulation of the subject's humoral or cellular immune 
responses, or by other means such as inhibiting the normal function 
of the 28-kDa antigen, or even competing with the antigen for 
interaction with some agent in the subject's body. 

Other and further aspects, features, and advantages of 
the present invention will be apparent from the following 
description of the presently preferred embodiments of the invention 
given for the purpose of disclosure. 
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BRIEF DESCRIPTION OF THE DRAWINGS 

So that the matter in which the above-recited features, 
advantages and objects of the invention, as well as others which will 
become clear, are attained and can be understood in detail, more 
particular descriptions of the invention briefly summarized above 
may be had by reference to certain embodiments thereof which are 
illustrated in the appended drawings. These drawings form a part of 
the specification. It is to be noted, however, that the appended 
drawings illustrate preferred embodiments of the invention and 
therefore are not to be considered limiting in their scope. 

Figure 1 shows nucleic acid sequence (SEQ ID No. 1) 
and deduced amino acid sequence (SEQ ID No. 2) of p28-7 gene 
including adjacent 5' and 3' non-coding sequences. The ATG start 
codon and TAA termination are shown in bold, and the 23 amino 
acid leader signal sequence is underlined. 

Figure 2 shows SDS-PAGE of expressed 50-kDa 
recombinant p28-7-thioredoxin fusion protein (Lane 1, arrow) and 
16-kDa thioredoxin control (Lane 2, arrow), and corresponding 
immunoblot of recombinant p28-7-thioredoxin fusion protein 



recognized by covalescent-phase E. canis canine antiserum (Lane 3). 
Thiroredoxin control was not detected by E. canis antiserum (not 
shown). 

Figure 3 shows amino acid sequences alignment of p28- 
5 7 protein (ECa28-l, SEQ ID NO. 2), p28-5 protein (ECa28SA2, partial 
sequence, SEQ ID NO. 7), p28-4 protein (ECa28SAl, SEQ ID NO. 8), E 
chaffeensis P28 (SEQ ID NO. 9), K chaffeensis OMP-1 family (SEQ ID 
p NOs: 10-14) and C. ruminantium MAP-1 protein (SEQ ID NO. 15). 

s ■ ■ 

ffi The p28-7 amino acid sequence is presented as the consensus 

ill 

CfJ 1 0 sequence. Amino acids not shown are identical to p28-7 and are 



□ 



s 



represented by a dot. Divergent amino acids are shown with the 
; u corresponding one letter abbreviation. Gaps introduced for 
maximal alignment of the amino acid sequences are denoted with a 
dash. Variable regions are underlined and denoted (VR1, VR2, VR3, 
15 and VR4). The arrows indicate the predicted signal peptidase 
cleavage site for the signal peptide. 

Figure 4 shows phylogenetic relatedness of E. canis p28- 
7 (ECa28-l), p28-5 (ECa28SA2, partial sequence), p28-4 
(ECa28SAl), members of the E. chaffeensis omp-1 multiple gene 
20 family, and C. rumanintium map-1 protein from deduced amino acid 
sequences utilizing unbalanced tree construction. The length of 
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each pair of branches represents the distance between the amino 
acid sequence of the pairs. The scale measures the distance 
between sequences. 

Figure 5 shows Southern blot analysis of E. canis 
5 genomic DNA completely digested with six individual restriction 
enzymes and hybridized with a p28-7 DIG-labeled probe (Lanes 2-7); 
DIG-labeled molecular weight markers (Lanes 1 and 8). 

: _ 

q Figure 6 shows comparison of predicted protein 

5 

CP characteristics of E. canis p28-7 (ECa28-l, Jake strain) and E 

lU 

^10 chaffeensis P28 (Arkansas strain). Surface probability predicts the 

e surface residues by using a window of hexapeptide. A surface 

Q 

^ residue is any residue with a >2.0 nm 2 of water accessible surface 

W 

U area. A hexapeptide with a value higher than 1 was considered as 

ru 

surface region. The antigenic index predicts potential antigenic 
15 determinants. The regions with a value above zero are potential 
antigenic determinants. T-cell motif locates the potential T-cell 
antigenic determinants by using a motif of 5 amino acids with 
residue 1-glycine or polar, residue 2-hydrophobic, residue 3- 
hydrophobic, residue 4-hydrophobic or proline, and residue 5 -polar 
20 or glycine. The scale indicates amino acid positions. 

Figure 7 shows nucleic acid sequences and deduced 

1 1 



amino acid sequences of the K canis 28-kDa protein genes p28-5 
(nucleotide 1-849: SEQ ID No. 3; amino acid sequence: SEQ ID No. 4) 
and p28-6 (nucleotide 1195-2031: SEQ ID No. 5; amino acid 
sequence: SEQ ID No. 6) including intergenic noncoding sequences 
5 (NC2, nucleotide 850-1194: SEQ ID No. 31). The ATG start codon 
and termination condons are shown in bold. 

Figure 8 shows schematic of the E. canis 28-kDa protein 
§ gene locus (5.592-Kb, containing five genes) indicating genomic 

Is is J 

fjjj orientation and intergenic noncoding regions (28NC1-4). The 28- 

5 

TUlO kDa protein genes shown in Locus 1 and 2 (shaded) have been 
p described (McBride et al., 1999; Reddy et al. 9 1998; Ohashi et al., 
1998). The complete sequence of p28-5 and a new 28-kDa protein 

I* 

P gene designated p28-6 was sequenced. The noncoding intergenic 

fU 

regions (28NC2-3) between p28-5, p28-6 and p28- 7 were completed 

15 joining the previously unlinked loci 1 and 2. 

Figure 9 shows phylogenetic relatedness of the E. canis 
28-kDa protein gene p28-4 (ECa28SAl), p28-5 (ECa28SA2), p28-6 
(ECa28SA3), p28-7 (ECa28-l) and p28-8 (ECa28-2) based on amino 
acid sequences utilizing unbalanced tree construction. The length 

20 of each pair of branches represents the distance between amino acid 
pairs. The scale measures the distance between sequences. 
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Figure 10 shows alignment of E. canis 28-kDa protein 
gene intergenic noncoding nucleic acid sequences (SEQ ID Nos. 30- 
33). Nucleic acids not shown, denoted with a dot (.), are identical 
to noncoding region 1 (28NC1). Divergence is shown with the 
corresponding one letter abbreviation. Gaps introduced for 
maximal alignment of the amino acid sequences are denoted with a 
dash (-). Putative transcriptional promoter regions (-10 and -35) 
and ribosomal binding site (RBS) are boxed. 

Figure 11 shows schematic representation of the nine 
fiJlO gene E. canis p 2 8 locus (10,677-bp) indicating genomic orientation 
and intergenic noncoding regions. The p28 genes (/?2<3-l, 2, 3, 9 ) 
(unshaded) were identified in Example 8. Shaded p28 genes have 
£3 been identified previously and designated as follows: p28-4, p30a 

ru . 

(Ohashi et al., 1998b) and ORF1 (Reddy et al., 1998); p28-5 and 
15 p28-6 9 (McBride, et. al., 2000); p28-7, p28 (McBride et al., 1999) 
and p30 (Ohashi et al., 1998b); and p28-8, p30-l (Ohashi et al., 
1998b). 

Figure 12 shows phylogenetic relationships of E. canis 
P28-1 to P28-9 based on the amino acid sequences. The length of 
20 each pair of branches represents the distance between amino acid 



13 



P 
hi 



pairs. The scale measures the percentage of divergence between the 
sequences. 

Figure 13 shows nucleic acid sequence (SEQ ED No. 3 9) 
and deduced amino acid sequence (SEQ ID No. 40) of E. canis p28-l 
gene. 

Figure 14 shows nucleic acid sequence (SEQ ID No. 41) 
and deduced amino acid sequence (SEQ ID No. 42) of £. canis p28-2 
gene. 

Figure 15 shows nucleic acid sequence (SEQ ID No. 43) 
and deduced amino acid sequence (SEQ ID No. 44) of E. canis p28-3 
gene. 

Figure 16 shows nucleic acid sequence (SEQ ID No. 45) 
and deduced amino acid sequence (SEQ ID No. 46) of E. canis p28-9 
gene. 

DETAILED DESCRIPTION OF THE INVENTION 

The present invention describes cloning, sequencing and 
expression of homologous genes encoding a 30-kilodalton (kDa) 
protein of Ehrlichia canis. A comparative molecular analysis of 

14 



homologous genes among seven E. cams isolates and the E 

chaff eensis omp-1 multigene family was also performed. Several 
new 28-kDa protein genes are identified as follows: 

p28-7 (ECa28-l) has an 834-bp open reading frame 

encoding a protein of 278 amino acids (SEQ ID No. 2) with a 

predicted molecular mass of 30.5-kDa. An N-terminal signal 
sequence was identified suggesting that the protein is post- 

J3 translationally modified to a mature protein of 27.7-kDa. 

m 

cn 

pit P28-6 (ECa28SA3) has an 840-bp open reading frame 

m 

fUlO encoding a 280 amino acid protein (SEQ ID No. 6). 
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Using PCR to amplify 28-kDa protein genes of E.canis, a 
previously unsequenced region of p28-5 (Eca28SA2) was completed. 
3 Sequence analysis of p28-5 revealed an 849-bp open reading frame 
encoding a 283 amino acid protein (SEQ ID No. 4). 
15 PCR amplification using primers specific for 28-kDa 

protein gene intergenic noncoding regions led to the sequencing of 
regions linkeding two previously separate loci, thereby identifying a 
single locus (5.592-kb) containing five 28-kDa protein genes (p28- 
4, -5, ~6 y -7 and -8). The five 28-kDa proteins were predicted to 
20 have signal peptides resulting in mature proteins, and had amino 
acid homology ranging from 51 to 72%. Analysis of intergenic 

15 



regions revealed hypothetical promoter regions for each gene, 
suggesting that these genes may be independently and differentially 
expressed. Intergenic noncoding regions (28NC1-4) ranged in size 
from 299 to 355-bp, and were 48 to 71% homologous. 

Furthermore, previously unknown regions of DNA 
upstream and downstream of the -above five gene locus of tandemly 
arranged p28 genes were sequenced, and p28-l, -2, -3, and -9 were 
identified. Consequently, a nine gene E. canis p28 locus spanning 
10, 677 bp was identified in the present invention. 

The present invention is directed to, inter alia, 
homologous 28-kDa protein genes in Ehrlichia canis, p28-l, -2, -3, - 
6 y -7, and p28-9, and a complete sequence of previously partially 
sequenced p28-5. Also disclosed is a multigene locus encoding nine 
homologous 28-kDa outer membrane proteins of Ehrlichia canis. 
Eight of the p28 genes were located on one DNA strand, and one p2 8 
gene was found on the complementary strand. The nucleic acid 
homology among the nine p28 gene members was 37 to 75%, and 
the amino acid homology ranged from 28 to 72%. 

In accordance with the present invention there may be 
employed conventional molecular biology, microbiology, and 
recombinant DNA techniques within the skill of the art. Such 

16 



techniques are explained fully in the literature. See, e.g., Maniatis, 
Fritsch & Sambrook, "Molecular Cloning: A Laboratory Manual 
(1982); "DNA Cloning: A Practical Approach," Volumes land II (D.N. 
Glover ed. 1985); "Oligonucleotide Synthesis" (M.J. Gait ed. 1984); 
"Nucleic Acid Hybridization" [B.D. Hames & SJ. Higgins eds. 
(1985)]; "Transcription and Translation" [B.D. Hames & SJ. Higgins 
eds. (1984)]; "Animal Cell Culture" [R.I. Freshney, ed. (1986)]; 
"Immobilized Cells And Enzymes" [IRL Press, (1986)]; B. Perbal, "A 



Practical Guide To Molecular Cloning" (1984). 

fii 

CH 10 The invention includes a substantially pure DNA 

ru " 

C P encoding a 28-kDa immunoreactive protein of Ehrlichia canis. The 

s 

p 

j£ protein encoded by the DNA of this invention may share at least 

W 

M* 80% sequence identity (preferably 85%, more preferably 90%, and 

O 

r ' y most preferably 95%) with the amino acids listed in SEQ ID No. 2, 4, 

15 6, 40, 42, 44 or 46. More preferably, the DNA includes the coding 
sequence of the nucleotides of SEQ ID No. 1, 3, 5, 39, 41, 43, 45, or 
a degenerate variant of such a sequence. 

It is well known in the art that the amino acid sequence 
of a protein is determined by the nucleotide sequence of the DNA 
20 that encodes the protein. Because of the degeneracy of the genetic 
code (i.e., for most amino acids, more than one nucleotide triplet 
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(codon) codes for a single amino acid), different nucleotide 
sequences can code for a particular amino acid, or polypeptide. 
Thus, the polynucleotide sequences of the subject invention also 
encompass those degenerate sequences that encode the 
5 polypeptides of the subject invention, or a fragment or variant 
thereof. * 

This invention also includes a substantially pure DNA 

U 

O containing a sequence of at least 15 consecutive nucleotides 

ffl 

}\* (preferably 20, more preferably 30, even more preferably 50, and 

m 

ru 1 0 most preferably all) of the region from the nucleotides listed in SEQ 
ID No 1, 3, 5, 39, 41, 43, or 45. 

P 

y By "substantially pure DNA" is meant DNA that is not 

p part of a milieu in which the DNA naturally occurs, by virtue of 
separation (partial or total purification) of some or all of the 
15 molecules of that milieu, or by virtue of alteration of sequences that 
flank the claimed DNA. The term therefore includes, for example, a 
recombinant DNA which is incorporated into a vector, into an 
autonomously replicating plasmid or virus, or into the genomic DNA 
of a prokaryote or eukaryote; or which exists as a separate molecule 
20 (e.g., a cDNA or a genomic or cDNA fragment produced by 
polymerase chain reaction (PCR) or restriction endonuclease 
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digestion) independent of other sequences. It also includes a 
recombinant DNA which is part of a hybrid gene encoding an 
additional polypeptide sequence, e.g., a fusion protein. Also 
included in the present invention is a recombinant DNA which 
5 includes a portion of the nucleotides listed in SEQ ED No 1, 3, 5, 39, 
41, 43, or 45 which encodes a 28-kDa immunoreactive protein of 
Ehrlichia canis. 

p The DNA should have at least about 70% sequence 

CP 

ff\ identity to the coding sequence of the nucleotides listed in SEQ ID 

ru 

?! 10 No 1, 3, 5, 39, 41, 43, or 45, preferably at least 75% (e.g. at least 

iU 
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~ 80%); and most preferably at least 90% identity. The identity 

Q 

M* between two sequences is a direct function of the number of 

W 



matching or identical positions. When a subunit position in both of 
the two sequences is occupied by the same monomeric subunit, e.g., 

15 if a given position is occupied by an adenine in each of two DNA 
molecules, then they are identical at that position. For example, if 7 
positions in a sequence 10 nucleotides in length are identical to the 
corresponding positions in a second 10-nucleotide sequence, then 
the two sequences have 70% sequence identity. The length of 

20 comparison sequences will generally be at least 50 nucleotides, 
preferably at least 60 nucleotides, more preferably at least 7 5 
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nucleotides, and most preferably 100 nucleotides. Sequence 
identity is typically measured using, sequence analysis software (e.g., 
Sequence Analysis Software Package of the Genetics Computer 
Group, University of Wisconsin Biotechnology Center, 1710 
5 University Avenue, Madison, WI 53705). 

The present . invention also comprises a vector 
comprising a DNA sequence coding for a which encodes a gene 

M> 

U encoding a 28-kDa immunoreactive protein of Ehrlichia canis and 

Q 

J| said vector is capable of replication in a host which comprises, in 

m 

ftflO. operable linkage: a) an origin of replication; b) a promoter; and c) a 

* DNA sequence coding for said protein. Preferably, the vector of the 

j s j present invention contains a portion of the DNA sequence shown in 

Q SEQ ID No 1, 3, 5, 39, 41, 43, or 45. 

ru 

A "vector" may be defined as a replicable nucleic acid 
15 construct, e.g., a plasmid or viral nucleic acid. Vectors may be used 
to amplify and/or express nucleic acid encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis. An expression vector is a 
replicable construct in which a nucleic acid sequence encoding a 
polypeptide is operably linked to suitable control sequences capable 
20 of effecting expression of the polypeptide in a cell. The need for 
such control sequences will vary depending upon the cell selected 

20 
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and the transformation method chosen. . Generally, control 
sequences include a transcriptional promoter and/or enhancer, 
suitable mRNA ribosomal binding sites, and sequences which control 
the termination of transcription and translation. Methods which are 
well known to those skilled in the art can be used to construct 
expression vectors containing * appropriate transcriptional and 
translational control signals. See for example, the techniques 
described in Sambrook et al., 1989, Molecular Cloning: A Laboratory 
fij Manual (2nd Ed.), Cold Spring Harbor Press, N.Y. A gene and its 

m 

rU 10 transcription control sequences are defined as being "operably 
p linked" if the transcription control sequences effectively control the 

2 

UJ transcription of the gene. Vectors of the invention include, but are 

O not limited to, plasmid vectors and viral vectors. Preferred viral 

ru 

vectors of the invention are those derived from retroviruses, 
15 adenovirus, adeno-associated virus, SV40 virus, or herpes viruses. 

In general, expression vectors containing promoter 
sequences which facilitate the efficient transcription of the inserted 
DNA fragment are used in connection with the host. As used herein, 
the term "host" is meant to include not only prokaryotes but also 
20 eukaryotes such as yeast, plant and animal cells. A recombinant 
DNA molecule or gene which encodes a 28-kDa immunoreactive 

2 1 



protein of Ehrlichia canis of the present invention can be used to 
transform a host using any of the techniques commonly known to 
those of ordinary skill in the art. Especially preferred is the use of a 
vector containing coding sequences for a gene encoding a 28-kDa 
5 immunoreactive protein of Ehrlichia canis of the present invention 
for purposes of prokaryote transformation. 

Prokaryotic hosts^ may include E: coli, S. tymphimurium, 

g Serratia marcescens and Bacillus subtilis. Eukaryotic hosts include 

P 

Eft yeasts such as Pichia pastoris, mammalian cells and insect cells. The 

ru 

i[j 10 transformed hosts can be fermented and cultured according to 

\ m means known in the art to achieve optimal cell growth. 

CP 

M* As used herein, the term "engineered" or "recombinant" 

Ul 

fn cell is intended to refer to a cell into which a recombinant gene, 

ru 

such as a gene encoding an Ehrlichia canis antigen has been 
15 introduced. Therefore, engineered cells are distinguishable from 
naturally occurring cells which do not contain a recombinantly 
introduced gene. Engineered cells are thus cells having a gene o r 
genes introduced through the hand of man. Recombinantly 
introduced genes will either be in the form of a cDNA gene, a copy 
20 of a genomic gene, or will include genes positioned adjacent to a 
promoter not naturally associated with the particular introduced 
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gene. In addition, the recombinant gene may be integrated into the 
host genome, or it may be contained in a vector, or in a bacterial 
genome transfected into the host cell. 

The present invention is also drawn to substantially pure 
5 28-30 kDa immunoreactive proteins of E. canis comprise of amino 
acid sequences listed in, for example, SEQ ID No. 2, 4, 6, 40, 42, 44, 
or 46. 

*jj By a "substantially pure protein" is meant a protein 

5 

C?i which has been separated from at least some of those components 

ru 

CPlO which naturally accompany it. Typically, the protein is substantially 

; y 

y pure when it is at least 60%, by weight, free from the proteins and 

0 ...... 

l& other naturally-occurring organic molecules with which it is 

W 

naturally associated in vivo. Preferably, the purity of the 

ru 

preparation is at least 75%, more preferably at least 90%, and most 
15 preferably at least 99%, by weight. A substantially pure 28-kDa 
immunoreactive protein of Ehrlichia canis may be obtained, for 
example, by extraction from a natural source; by expression of a 
recombinant nucleic acid encoding a 28-kDa immunoreactive 
protein of Ehrlichia canis\ or by chemically synthesizing the protein. 
20 Purity can be measured by any appropriate method, e.g., column 
chromatography such as immunoaffinity chromatography using an 
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antibody specific for a 28-kDa immunoreactive protein of Ehrlichia 
canis, polyacrylamide gel electrophoresis, or HPLC analysis. A 
protein is substantially free of naturally associated components 
when it is separated from at least some of those contaminants which 
5 accompany it in its natural state. Thus, a protein which is 
chemically synthesized or produced in a cellular system different 
from the cell from which it naturally originates will be, by 

I* definition, substantially free from its naturally associated 

3 

rji components. Accordingly, substantially pure proteins include 

fy 

CP 10 eukaryotic proteins synthesized in E. coli, other prokaryotes, or any 

ru 

*P other organism in which they do not naturally occur. 

Q 

u In addition to substantially full-length proteins, the 

hi 

M> invention also includes fragments (e.g., antigenic fragments) of the 

Q " 

n\ 

ih * 28-kDa immunoreactive protein of Ehrlichia canis (SEQ ID No. 2, 4, 
15 6, 40, 42, 44, or 46). As used herein, "fragment," as applied to a 
polypeptide, will ordinarily be at least 10 residues, more typically at 
least 20 residues, and preferably at least 30 (e.g., 50) residues in 
length, but less than the entire, intact sequence. Fragments of the 
28-kDa immunoreactive protein of Ehrlichia canis can be generated 
20 by methods known to those skilled in the art, e.g., by enzymatic 
digestion of naturally occurring or recombinant 28-kDa 
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immunoreactive protein of Ehrlichia canis, by recombinant DNA 
techniques using an expression vector that encodes a defined 
fragment of 28-kDa immunoreactive protein of Ehrlichia canis, or by 
chemical synthesis. The ability of a candidate fragment to exhibit a 
characteristic of 28-kDa immunoreactive protein of Ehrlichia canis 
(e.g., binding to an antibody specific for 28-kDa immunoreactive 
protein of Ehrlichia canis) can be assessed by methods described 
herein. 

Purified 28-kDa immunoreactive protein of Ehrlichia 
canis or antigenic fragments of 28-kDa immunoreactive protein of 
Ehrlichia canis can be used to generate new antibodies or to test 
existing antibodies (e.g., as positive controls in a diagnostic assay) 
by employing standard protocols known to those skilled in the art. 

As is well known in the art, a given polypeptide may vary 
in its immunogenicity. It is often necessary therefore to couple the 
immunogen (e.g., a polypeptide of the present invention) with a 
carrier. Exemplary and preferred carriers are keyhole limpet 
hemocyanin (KLH) and human serum albumin. Means for 
conjugating a polypeptide to a carrier protein are well known in the 
art and include glutaraldehyde, m-maleimidobenzoyl-N- 
hydroxysuccinimide ester, carbo-diimide and bis-biazotized 



benzidine. It is also understood that the peptide may be conjugated 
to a protein by genetic engineering techniques that are well known 
in the art. 

As is also well known in the art, immunogenicity to a 
5 particular immunogen can be enhanced by the use of non-specific 
stimulators of the immune response known as adjuvants. Exemplary 
and preferred adjuvants include complete BCG, Detox, (RIBI, 
Immunochem Research: Inc.) ISCOMS and aluminum hydroxide 

Q 

jp adjuvant (Superphos, Biosector). 

ru 

Cfl 10 Included in this invention are polyclonal antisera 



generated by using 28-kDa immunoreactive protein of Ehrlichia 

canis or a fragment of 28-kDa immunoreactive protein of Ehrlichia 

W 

M canis as the immunogen in, e.g., rabbits. Standard protocols for 



ru 



monoclonal and polyclonal antibody production known to those 
15 skilled in this art are employed. The monoclonal antibodies 

generated by this procedure can be screened for the ability to 

identify recombinant Ehrlichia canis cDNA clones, and to distinguish 

them from known cDNA clones. 

The invention encompasses not only an intact 
20 monoclonal antibody, but also an immunologically-active antibody 

fragment, e.g., a Fab or (Fab)2 fragment; an engineered single chain 
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Fv molecule; or a chimeric molecule, e.g., an antibody which 
contains the binding specificity of one antibody, e.g., of murine 
origin, and the remaining portions of another antibody, e.g., of 
human origin. 

5 In one embodiment, the antibody, or fragment thereof, 

may be linked to a toxin or to a detectable label, e.g. a radioactive, 
label, non-radioactive isotopic label, fluorescent label, 

H» chemiluminescent label, paramagnetic label, enzyme label or 

S 

^ colorimetric label. Those of ordinary skill in the art will know of 

ru 

Cfl 10 these and other suitable labels which may be employed in 
c ^ accordance with the present invention. The binding of these labels 

5 

2 to antibodies or fragments thereof can be accomplished using 

W 

H* standard techniques commonly known to those of ordinary skill in 
the art. 

15 It is also contemplated that pharmaceutical compositions 

may be prepared using the novel proteins of the present invention. 
In such a case, the pharmaceutical composition comprises the novel 
active composition(s) of the present invention and a 
pharmaceutically acceptable carrier. A person having ordinary skill 
20 in this art would readily be able to determine, without undue 



ru 
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experimentation, the appropriate dosages and routes of 
administration of the active component of the present invention. 

The phrase "pharmaceutically acceptable" refers to 
molecular entities and compositions that do not produce an allergic 
or similar untoward reaction when administered to a subject. The 
preparation of an aqueous composition that contains a protein as an 
active ingredient is well understood in the art. Typically, such 
compositions are prepared as injectables, either as liquid solutions 
or suspensions; solid forms suitable for solution in, or suspension 
in, liquid prior to injection can also be prepared. The preparation 
can also be emulsified. 

A protein may be formulated into a composition in a 
neutral or salt form. Pharmaceutically acceptable salts, include the 
acid addition salts (formed with the free amino groups of the 
protein) and which are formed with inorganic acids such as, for 
example, hydrochloric or phosphoric acids, or such organic acids as 
acetic, oxalic, tartaric, mandelic, and the like. Salts formed with the 
free carboxyl groups can also be derived from inorganic bases such 
as, for example, sodium, potassium, ammonium, calcium, or ferric 
hydroxides, and such organic bases as isopropylamine, 
trimethylamine, histidine, procaine and the like. 

28 




Upon formulation, solutions will be administered in a 
manner compatible with the dosage formulation and in such amount 
as is therapeutically effective. The formulations are easily 
administered in a variety of dosage forms such as injectable 
5 solutions. 

For parenteral administration in an aqueous solution, for 
example, the solution should be suitably buffered if necessary and 
the liquid diluent first rendered isotonic with sufficient saline o r 

p 

P glucose. These particular aqueous solutions are especially suitable 

m " • 

L^IO for intravenous, intramuscular, subcutaneous and intraperitoneal 

ft I 

J* administration. In this connection, sterile aqueous media which can 



UJ 

ru 



be employed will be known to those of skill in the art in light of the 
present disclosure. For example, one dosage could be dissolved in 1 
mL of isotonic NaCl solution and either added to lOOOmL of 

15 hypodermoclysis fluid or injected at the proposed site of infusion, 
(see for example, "Remington's Pharmaceutical Sciences" 15 th 
Edition, pages 1035-1038 and 1570-1580). Some variation in 
dosage will necessarily occur depending on the condition of the 
subject being treated. The person responsible for administration 

20 will, in any event, determine the appropriate dose for the individual 
subject. 
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In one embodiment of the present invention, there are 
provided DNA sequences encoding a 30-kDa immunoreactive 
protein of Ehrlichia canis. Preferably, the protein has an amino acid 
sequence selected from the group consisting of SEQ ID No. 2, 4, 6 , 
5 40, 42, 44, 46, and the gene has a nucleic acid sequence selected 
from the group consisting of SEQ ID No. 1, 3, 5, 39, 41, 43, 45 and is 
a member of a polymorphic multiple gene family. More preferably, 

^ the protein has an N-terminal signal sequence which is cleaved after 

□ 

fp post-translational process resulting in the production of a mature 

ru 

ffilO 28-kDa protein. Still preferably, the DNAs encoding 28-kDa 

ru 

C P proteins are contained in a single multigene locus, which has the 

o 

size of 10,677 bp and encodes nine homologous 28-kDa proteins of 

W 

h* Ehrlichia canis. 
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In another embodiment of the present invention, there is 
15 provided an expression vector comprising a gene encoding a 28-kDa 
immunoreactive protein of Ehrlichia canis and capable of expressing 
the gene when the vector is introduced into a cell. 

In still another embodiment of the present invention, 
there is provided a recombinant protein comprising an amino acid 
20 sequence selected from the group consisting of SEQ ID No. 2, 4, 6, 
40, 42, 44, 46. Preferably, the amino acid sequence is encoded by a 
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nucleic acid sequence selected from the group consisting of SEQ ID 
No. 1, 3, 5, 39, 41, 43, 45. More preferably, the recombinant 
protein comprises four variable regions which are surface exposed, 
hydrophilic and antigenic. Still preferably, the recombinant protein 
5 is an antigen. 

In yet another embodiment of the present invention, 
there is provided a method of producing the recombinant protein, 
comprising the steps of obtaining a vector that comprises an 
f f l expression region comprising a sequence encoding the amino acid 

fy 

Cfl 1 0 sequence selected from the group consisting of SEQ ID No. 2,4,6, 

ru 

s h 40, 42, 44, 46 operatively linked to a promoter; transfecting the 

£ 

Q 

p t vector into a cell; and culturing the cell under conditions effective 

W 

M* for expression of the expression region. 

M 

rl I 

s * The invention may also be described in certain 

15 embodiments as a method of inhibiting Ehrlichia canis infection in a 
subject comprising the steps of: identifying a subject suspected of 
being exposed to or infected with Ehrlichia canis\ and administering 
a composition comprising a 28-kDa antigen of Ehrlichia canis in an 
amount effective to inhibit an Ehrlichia canis infection. The 
20 inhibition may occur through any means such as, i.e. the stimulation 
of the subject's humoral or cellular immune responses, or by other 

3 1 
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means such as inhibiting the normal function of the 28-kDa antigen, 
or even competing with the antigen for interaction with some agent 
in the subject's body. 

The following examples are given for the purpose of 
illustrating various embodiments of the invention and are not meant 
to limit the present invention in ahy fashion. 



EXAMPLE 1 

Sequencing Unknown 5' and T Regions of the ECa28-l (p2R-7) Gene 
Ehrlichiae and Purification Ehrlichia canis (Florida 

w 

f* strain and isolates Demon, DJ, Jake, and Fuzzy) were provided by 



Dr. Edward Breitschwerdt, (College of Veterinary Medicine, North 
15 Carolina State University, Raleigh, NC). E. canis (Louisiana strain) 
was provided by Dr. Richard E. Corstvet (School of Veterinary 
Medicine, Louisiana State University, Baton Rouge, LA) and E. canis 
(Oklahoma strain) was provided by Dr. Jacqueline Dawson (Centers 
for Disease Control and Prevention, Atlanta, GA). Propagation of 
20 ehrlichiae was performed in DH82 cells with DMEM supplemented 
with 10% bovine calf serum and 2 mM L-glutamine at 37°C. The 
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intracellular growth in DH82 cells was monitored by presence of R 
canis morulae using general cytologic staining methods. Cells were 
harvested when 100% of the cells were infected with ehrlichiae and 
were then pelleted in a centrifuge at 17,000 x g for 20 min. Cell 
5 pellets were disrupted with a Braun-Sonic 2000 sonicator twice at 
40W for 30 sec on ice. Ehrlichiae were purified as described 
previously (Weiss et ai f 1975). The lysate was loaded onto 
^ discontinuous gradients of 42%-36%-30% renografin, and 

5 

fp centrifuged at 80,000 x g for 1 hr. Heavy and light bands containing 

ru 

1 0 ehrlichiae were collected and washed with sucrose-phosphate- 

f glutamate buffer (SPG, 218 mM sucrose, 3.8 mM KH 2 P0 4 , 7.2 mM 

P 

M* K 2 HP0 4 , 4.9 mM glutamate, pH 7.0) and pelleted by centrifugation. 

W 

N* . 
O 

ru 

Nucleic Acid Preparation Ehrlichia canis genomic DNA 
15 was prepared by resuspending the renografin-purified ehrlichiae in 
600 ^il of 10 mM Tris-HCl buffer (pH 7.5) with 1% sodium dodecyl 
sulfate (SDS, w/v) and 100 ng/ml of proteinase K as described 
previously (McBride et aL, 1996). This mixture was incubated for 1 
hr at 56° C, and the nucleic acids were extracted twice with a 
20 mixture of phenol/chloroform/isoamyl alcohol (24:24:1). DNA was 
pelleted by absolute ethanol precipitation, washed once with 70% 
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ethanol, dried and resuspended in lOmM Tris (pH 7.5). Plasmid 
DNA was purified by using High Pure Plasmid Isolation Kit 
(Boehringer Mannheim, Indianapolis, IN), and PCR products were 
purified using a QIAquick PCR Purification Kit (Qiagen, Santa Clarita, 
5 CA). 

Cloning of ECa28-l (p28-7) Gene The full length 

u sequence of p28-7 gene was determined using a Universal 

P 

a GenomeWalker Kit (CLONTECH, Palo Alto, CA) according to the 

m 

j|jl0 protocol supplied by the manufacturer. Genomic E. canis (Jake 

fu 

c p isolate) DNA was digested completely with five restriction enzymes 



P (Oral EcoRV, PvwII, Seal StuI) which produce blunt-ended DNA. An 

M* 

^ adapter (API) supplied in the kit was ligated to each end of E. canis 

m 
US 

ry DNA. The genomic libraries were used as templates to find the 
15 unknown DNA sequence of the p28-7 gene by PCR using a primer 
complementary to a known portion of the p28-7 sequence and a 
primer specific for the adapter API. Primers specific for p28-7 used 
for genome walking were designed from the known DNA sequence 
derived from PCR amplification of p28-7 with primers 793 (SEQ ID 
20 NO. 16) and 1330 (SEQ ID NO. 17). Primers 394 (5'- 
GCATTTCCACAGGATCATAGGTAA^; nucleotides 687-710, SEQ ID 
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NO. 21) and 394C (5 '-TT ACCT ATG ATCCTGT GGAAATGC-3; 
nucleotides 710-687, SEQ ID NO. 22) were used in conjunction with 
supplied primer API to amplify the unknown 5' and 3' regions of the 
p28-7 gene by PGR. A PCR product corresponding to the 5' region of 
5 the p28-7 gene amplified with primers 394C and API (2000-bp) was 
sequenced unidirectionally with primer 793C (5'-GAGTA 
ACCAACAGCTCCTGC-3', SEQ ID No. 23). A PCR product 
y& corresponding to the 3' region of the p28-7 gene amplified with 
P primers 394 and API (580-bp) was sequenced bidirectionally with 

cn 

^10 the same primers. Noncoding regions on the 5 ? and 3' regions 

rii 

K p adjacent to the open reading frame were sequenced, and primers 

0 EC280M-F (S'-TCTACTTTGCACTTCC ACTATTGT-3', SEQ ID NO. 24) 

[jf and EC280M-R (5 ATTCTTTTGCC ACT ATTT TTCTTT-3', SEQ ID NO. 

s 

ry 25) complementary to these regions were designed in order to 
15 amplify the entire p28-7 gene. 

DNA Sequencing DNA was sequenced with an ABI 

Prism 377 DNA Sequencer (Perkin- Elmer Applied Biosystems, Foster 
City, CA). The entire p28-7 genes of seven E. canis isolates (four 
20 from North Carolina, and one each from Oklahoma, Florida, and 
Louisiana) were amplified by PCR with primers EC280M-F (SEQ ID 
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No. 24) and EC280M-R (SEQ ID No. 25) with a thermal cycling 
profile of 95°C for 5 minutes, and 30 cycles of 95°C for 30 seconds, 
62°C for 1 minutes, and 72°C for 2 minutes and a 72°C extension for 
10 minutes. The resulting PGR products were bidirectionally 
5 sequenced with the same primers. 



£ EXAMPLE 2 

a 
□ 

en ' 
ru 

£P 1 0 PCR Amplification, Cloning, Sequencing and Expression of £. canis 

f EC a 2 8-I (p28-7) ' G ene 

b 

Expression Vectors The entire E. canis p2 8 -7 gene was 



Q 



PCR-amplified with primers-EC280M-F and EC280M-R and cloned 
into pCR2.1-TOPO TA cloning vector to obtain the desired set of 

15 restriction enzyme cleavage sites (Invitrogen, Carlsbad, CA). The 
insert was excised from pCR2.1-TOPO with BstX 1 and ligated into 
pcDNA 3.1 eukaryotic expression vector (Invitrogen, Carlsbad, CA) 
designated pcDNA3.1/EC28 for subsequent studies. The 
pcDNA3.1/EC28 plasmid was amplified, and the gene was excised 

20 with a Kpnl-Xbal double digestion and directionally ligated into 
pThioHis prokaryotic expression vector (Invitrogen, Carlsbad, CA). 
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The clone (designated pThioHis/EC28) produced a recombinant 
thioredoxin fusion protein in Escherichia coli BL2L The 
recombinant fusion protein was crudely purified in the insoluble 
phase by centrifugation. The control thioredoxin fusion protein was 
5 purified from soluble cell lysates under native conditions using 
nickel-NTA spin columns (Qiagen, "Santa Clarita, CA). 

Western Blot Analysis Recombinant E. canis p28-7 

U* 
Q 

rp. fusion protein was subjected to SDS-polyacrylamide gel. 

fll 

S 10 electrophoresis (SDS-PAGE) on 4-15% Tris-HCl gradient gels (Bio- 

ru 

'P Rad, Hercules, GA) and transferred to pure nitrocellulose 

8 

C3 

fa (Schleicher & Schuell, Keene, NH) using a semi-dry transfer cell 

w 

(Bio-Rad, Hercules, CA). The membrane was incubated with 



ru 



convalescent phase antisera from an E. can/^-infected dog diluted 
15 1:5000 for 1 hour, washed, and then incubated with an anti-canine 
IgG (H & L) alkaline phosphatase-conjugated affinity-purified 
secondary antibody at 1:1000 for 1 hour (Kirkegaard & Perry 
Laboratories, Gaithersburg, MD). Bound antibody was visualized 
with 5-bromo-4-chloro-3-indolyl phosphate/nitroblue tetrazolium 
20 (BCIP/NBT) substrate (Kirkegaard & Perry Laboratories, 
Gaithersburg, MD). 
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Southern Blot Analysis To determine if multiple 

genes homologous to the p28-7 gene were present in the E. canis 
genome, a genomic Southern blot analysis was performed using a 
5 standard procedure (Sambrook et al. 1989). E. canis genomic DNA 
digested completely with each Of the restriction enzymes Banll, 
EcoRV, Haell, Kpnl and Spel, which do not cut within the p28-7 gene, 
M and Asel which digests p28-7 at nucleotides 34, 43 and 656. The 

9 

probe was produced by PGR amplification with primers EC280M-F 

ru 

if\\0 and EC280M-R and digoxigenin (DIG)-labeled deoxynucleotide 

ru 

*P triphosphates (dNTPs) (Boehringer Mannheim, Indianapolis, IN) and 

£ 

Q 

jT digested with Asel. The digested probe (566-bp) was separated by 

W 

agarose gel electrophoresis, gel-purified and then used for 

a 

rU hybridization. The completely digested genomic E. canis DNA was 
15 electrophoresed and transferred to a nylon membrane (Boehringer 
Mannheim, Indianapolis, IN) and hybridized at 40°C for 16 hr with 
the p28-7 gene DIG-labeled probe in DIG Easy Hyb buffer according 
to the manufacturer's protocol (Boehringer Mannheim, Indianapolis, 
IN). Bound probe was detected with a anti-DIG alkaline 
20 phosphatase-conjugated antibody and a luminescent substrate 
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(Boehringer Mannheim, Indianapolis, IN) and exposed to BioMax 
scientific imaging film (Eastman Kodak, Rochester, NY). 

Sequence Analysis and Comparasion E. chaff eensis 
5 p28 and C. ruminantium map-1 DNA sequences were obtained from 
the National Center of Biotechnology Information (NCBI). 
Nucleotide and deduced amino acid sequences, and protein and 
phylogenetic analyses were performed with LASERGENE software 

5 

CP (DNASTAR, Inc., Madison, WI). Analysis of post-translational 

ry 

EH 1 0 processing was performed by the method of McGeoch and von 

i y 
-C 

g Heijne for signal sequence recognition using the PSORT program 

D 

h& (McGeoch, 1985; von Heijne, 1986) 

W 

Sequence analysis of p28-7 from seven different strains 

fU 

of E. canis was performed with primers designed to amplify the 
15 entire gene. Analysis revealed the sequence of this gene was 
conserved among the isolates from North Carolina (four), Louisiana, 
Florida and Oklahoma. 



Results 

20 Alignment of nucleic acid sequences from E. chaffeensis 

p28 and Cowdria ruminantium map-1 using the Jotun-Hein 
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aligorithm produced a consensus sequence with regions of high 
homology (>90%). These homologous regions (nucleotides 313-332 
and 823-843 of C. ruminantium map-1; 307-326 and 814-834 of E 
chaff eensis p28) were targeted as primer annealing sites for FOR 
5 amplification. PGR amplification of the E. canis p28-7 gene was 
accomplished with primers 793 * (5-GCAGGAGCTGTTGGTTACTC-3') 
(SEQ ID NO. 16) and 1330 (5'-CCTTCCTCCAAGTTCTATGCC-30 (SEQ 

M» ID NO. 17), resulting in a 518-bp PCR product. E. canis DNA was 

O 
f3 

g~ amplified with primers 793 and 1330 with a thermal cycling profile 

ry 

ifl 10 of 95°C for 2 min, and 30 cycles of 95°C for 30 sec, 62° C for 1 min, 

ru 

4* 72°C for 2 min followed by a 72°C extension for 10 min and 4°C 

5 

- Q ............... 

^ hold. The nucleic acid sequence of the E. canis PCR product was 

LM 

I* obtained by sequencing the product directly with primers 793 and 

Q 

ry 1330. 

15 Analysis of the sequence revealed an open reading frame 

encoding a protein of 170 amino acids, and alignment of the 518-bp 
sequence obtained from PCR amplification of E. canis with the DNA 
sequence of E. chaffeensis p28 gene revealed a similarity greater 
than 70%, indicating that the genes were homologous. 

20 Adapter PCR with primers 394 and 793C was performed 

to determine the 5' and 3' segments of the sequence of the entire 
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gene. Primer 394 produced four PCR products (3-kb, 2-kb, 1-kb, 
and 0.8-kb), and the 0.8-bp product was sequenced bidirectionally 
using primers 394 and API. The deduced sequence overlapped with 
the 3' end of the 518-bp product, extending the open reading frame 
5 12-bp to a termination codon. An additional 625-bp of non-coding 
sequence at the 3' end of the p28-7 gene was also sequenced. 

Primer 394C was used to amplify the 5* end of the p28-7 
gene with supplied primer API. Amplification with these primers 

B 

n 

fft resulted in three PCR products (3.3, 3-kb, and 2-kb). The 2-kb 

fll 

[ft 10 fragment was sequenced unidirectionally with primer 793C. The 

ri i 

4* sequence provided the putative start codon of the p28-7 gene and 

3 

o 

il completed the 834-bp open reading frame encoding a protein of 

W 

M s 278 amino acids. An additional 144-bp of readable sequence in the 

Q 

5' noncoding region of the p28-7 gene was generated. Primers 
15 EC280M-F and EC280M-R were designed from complementary non- 
coding regions adjacent to the p28-7 gene. 

The PCR product amplified with these primers was 
sequenced directly with the same primers. The complete DNA 
sequence for the E. canis p28-7 gene (SEQ ID NO. 1) is shown in 
20 Figure 1. The p28-7 PCR fragment amplified with these primers 
contained the entire open reading frame and 17 additional amino 
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acids from the 5* non-coding primer region. The gene was 
directionally subcloned into pThioHis expression vector, and E. coli 
(BL21) were transformed with this construct. The expressed p28-7- 
thioredoxin fusion protein was insoluble. The expressed protein had 
5 an additional 114 amino acids associated with the thioredoxin, 5 
amino acids for the enterokinase* recognition site, and 32 amino 
acids from the multiple cloning site and 5' non-coding primer region 
at the N-terminus. Convalescent-phase antiserum from an E. canis 
infected dog recognized the expressed recombinant fusion protein, 
10 but did not react with the thioredoxin control (Figure 2). 

EXAMPLE 3 

1 5 Sequence Homology of E. canis p28-7 Gene 

The nucleic acid sequence of E. canis p28-7 (834-bp) and 
the E. chaffeensis omp-1 family of genes including signal sequences 
(p28-7, omp-1 A, B, C, D, £, and F) were aligned using the Clustal 
method to examine homology between these genes (alignment not 

20 shown). Nucleic acid homology was equally conserved (68.9%) 
between E. canis p28-7 y E chaffeensis p28 and omp-1 F. Other 
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putative outer membrane protein genes in the E. chaffeensis omp-1 
family, omp-lD (68.2%), omp-lE (66.7%), omp-lC (64.1%), 
Cowdria ruminant ium map-1 (61.8%), E. canis 28-kDa protein 1 
gene (60%) and 28-kDa protein 2 gene (partial) (59.5%) were also 
5 homologous to p28-7. E. chaffeensis omp-lB had the least nucleic 
acid homology (45.1%) with E. cdnis p28-7. 

Alignment of the predicted amino acid sequences of E 
r« canis P28-7 (SEQ ID NO. 2) and E: chaffeensis P28 revealed amino 
y acid substitutions resulting in four variable regions (VR). 
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m 10 Substitutions or deletions in the amino acid sequence and the 

rii 

s P locations of variable regions of E. canis P28-7 and the E. chaffeensis 

E 

£3 

p; OMP-1 family were identified (Figure 3). Amino acid comparison 

iii 

j:^ including the signal peptide, revealed that E canis P28-7 shared the 
W most homology with OMP-1F (68%) of the E. chaffeensis OMP-1 
15 family, followed by E. chaffeensis P28 (65.5%), OMP-1E (65.1%), 
OMP-1D (62.9%), OMP-1C (62.9%), Cowdria ruminantium MAP-1 
(59.4%), E canis 28-kDa protein 1 (55.6%) and 28-kDa protein 2 
(partial) (53.6%), and OMP-1B (43.2%). The phylogenetic 
relationships based on amino acid sequences show that E. canis P28- 
20 7 and C. ruminantium MAP-1, E. chaffeensis OMP-1 proteins, and E 
canis 28-kDa proteins 1 and 2 (partial) are related (Figure 4). 
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EXAMPLE 4 



Predicted Surface Probability and Tmmnnoreactivity of E. ranis P28-7 
5 Analysis of E. canis P28-7 using hydropathy and 

hydrophilicity profiles predicted 'surface-exposed regions on P28-7 
(Figure 6). Eight major surface-exposed regions consisting of 3 to 9 

^ amino acids were identified on E. canis P28-7 and were similar to the 

D 
p 

rh . profile of surface-exposed regions on E chaffeensis P28 (Figure 6). 

ru 

tfl 10 Five of the larger surface-exposed regions on E. canis P28-7 were 

ru 

C P located in the N-terminal region of the protein. Surface-exposed 

£ 

P 

\2 hydrophilic regions were found in all four of the variable regions of 

bJ 

E. canis P28-7. Ten T-cell motifs were predicted in the P28-7 using 

Q 

rU the Rothbard-Taylor aligorithm (Rothbard and Taylor, 1988), and 

15 high antigenicity of the E. canis P28-7 was predicted by the Jameson- 
Wolf antigenicity aligorithm (Figure 6) (Jameson and Wolf, 1988). 
Similarities in antigenicity and T-cell motifs were observed between 
£. canis P28-7 and E. chaffeensis P28. 



20 
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EXAMPLES 



Detection of Homologous Genomic Copies of E. ranis p28-7 Gene 

Genomic Southern blot analysis of E. canis DNA 
completely digested independently with restriction enzymes Barill, 
EcoRV, Haell, Kpril, Spel, which do not have restriction endonuclease 
sites in the p28-7 gene, and Asel, which has internal restriction 
endonuclease sites at nucleotides 34, 43 and 656, revealed the 
presence of at least three homologous p28-7 gene copies (Figure 5). 
Although E. canis p28-7 has internal Ase. / internal restriction sites, 
the DIG-labeled probe used in the hybridization experiment targeted 
a region of the gene within a single DNA fragment generated by the 
Asel digestion of the gene. Digestion with Asel produced 3 bands 
(approximately 566-bp, 850-bp, and 3-kb) that hybridized with the 
p28-7 DNA probe indicating the presence of multiple genes 
homologous to p28-7 in the genome. Digestion with EcoRV and Spel 
produced two bands that hybridized with the p28-7 gene probe. 



EXAMPLE 6 



PCR Amplification of F ranis KCn?RSA2 (/j2*-S) , E£a2&SAl (p28-6) 
Genes and Identification of the Multiple Gene Locus 

In order to specifically amplify possible unknown genes 
downstream of ECa28SA2 (p28-5)\ primer 46f specific for p28-5 (5'- 
ATATACTTCCTACCTAATGTCTCA-3', SEQ ID No. 18), and primer 13 30 
(SEQ ID No. 17) which targets a conserved region on the 3' end of 
p28-7 gene were used for amplification. The amplified product was 
gel purified and cloned into a TA cloning vector (Invitrogen, Santa 
Clarita, CA). The clone was sequenced bidirectionally with primers: 
M13 reverse from the vector, 46f, ECa28SA2 (5'- 
AGTGCAGAGTCTTCGGTTTC-3', SEQ ID No. 19), ECa5.3 (5'- 
GTT ACTTGCGG AGG AC AT-3 ' , SEQ ID No. 20). DNA was amplified with 
a thermal cycling profile of 95°C for 2 min, and 30 cycles of 95°C 
for 30 sec, 48°C for 1 min, 72°C for 1 min followed by a 72°C 
extension for 10 min and 4°C hold. 

A 2-kb PGR product was amplified with these primers 
that contained 2 open reading frames. The first open reading frame 
contained the known region of the p28-5 gene and a previously 
unsequenced 3' portion of the gene. Downstream from p28-5 an 



additional non identical, but homologous 28-kDa protein gene was 
found, and designated ECa28SA3 (p28-6). 

Specific primers designated ECaSA3-2 (5'-CTAGGATTA 
GGTTATAGTATAAGTT-3', SEQ ID No. 26) corresponding to regions 
5 within p28-6 and primer 793C (SEQ ID No. 23) which anneals to a 
region with p28-7 were used to amplify the intergenic region 
between gene p28-6 and p28-7. DNA was amplified with a thermal 

n cycling profile of 95°C for 2 min, and 30 cycles of 95°C for 30 sec, 

□ 

5 50°C for 1 min, 72°C for 1 min followed by. a 72°C extension for 10 

ru 

|f| 10 min and 4°C hold. 

I 

£ An 800-bp PCR product was amplified which contained 

O 

[* the 3' end of p28-6, the intergenic region between p28-6 and p28-7 

h (28NC3) and the 5' end of p28-7, joining the previously separate 

ru 

loci (Figure 8). The 849-bp open reading frame of p28-5 encodes a 
15 283 amino acid protein, and p28-6 has an 840-bp open reading 
frame encoding a 280 amino acid protein. The intergenic noncoding 
region between p28-6 and p28-7 was 345-bp in length (Figures 7 
and 8) 
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EXAMPLE 7 



Nucleic and Amino Acid Homology of fi. canis p28-4, p28-5, p28-6, 
p28-7 and p28-8 proteins 

The nucleic arid amino acid sequences of all five E. canis 
28-kDa protein genes were aligned using the Clustal method to 
examine the homology between these genes. The nucleic acid 
homology ranged from 58 to 75% and a similar amino acid 
homology of ranging from 67 to 72% was observed between the E 
canis 28-kDa protein gene members (Figure 9). 

Transcriptional Promoter Regions The intergenic 

regions between the 28-kDa protein genes were analyzed for 
promoter sequences by comparison with consensus Escherichia coli 
promoter regions and a promoter from E. chaff eensis (Yu et al. 9 
1997; McClure, 1985). Putative promoter sequences including RBS, 
-10 and -35 regions were identified in 4 intergenic sequences 
corresponding to genes p28-5, p28-6, p28-7, and p28-8 (ECa28-2) 
(Figure 10). The upstream noncoding region of p28-4 (ECa28SAl) is 
not known and was not analyzed. 
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N-Terminal Signal Sequence The amino acid 

sequence analysis revealed that entire E.canis p28-7 has a deduced 
molecular mass of 30.5-kDa and the entire p28-6 has a deduced 
molecular mass of 30.7-kDa. Both proteins have a predicted N- 
terminal signal peptide of 23 amino acids 

(MNCKKILITTALMSLMYY APSIS, SEQ ID No. 27), which is similar to that 
predicted for E. chaffeensis P28 (MNYKKILITSALISLISSLPGV SFS, SEQ 
ID NO. 28), and the OMP-1 protein family (Yu et al, 1999a; Ohashi 
et al, 1998b). 

A preferred cleavage site for signal peptidases (SIS; Ser- 
X-Ser) (Oliver, 1985) is found at amino acids 21, 22, and 23 of p28- 
7. An additional putative cleavage site at amino acid position 2 5 
(MNCKKILITTALISLMYSIPSISSFS, SEQ ID NO. 29) identical to the 
predicted cleavage site of E. chaffeensis P28 (SFS) was also present, 
and would result in a mature p28-7 with a predicted molecular mass 
of 27.7-kDa. Signal cleavage site of the previously reported partial 
sequence of p28-5 is predicted at amino acid 30. However, signal 
sequence analysis predicted that p28-4 had an uncleavable signal 
sequence. 



Summary 

Proteins of similar molecular mass have been identified 
and cloned from multiple rickettsial agents including E. canis, E 
chaff eensis, and C. ruminantium (Reddy et aL, 1998; Jongejan et al., 
5 1993; Ohashi et aL, 1998). A single locus in Ehrlichia chaff eensis 
with 6 homologous p28 genes, and* 2 loci in E. canis, each containing 
some homologous 28-kDa protein genes have been previously 

it described. 

£3 

f|l The present invention demonstrated the cloning, 

ru 

CHlO expression and characterization of genes encoding mature 28-kDa 

ru 

jj* proteins of E. canis that are homologous to the omp-1 multiple gene 

U family of E. chaff eensis and the C ruminantium map- 1 gene. Two 

UJ 

I s * new 28-kDa protein genes were identidfied, p28-7 and p28-6. 

□ 
rn 

Another Exanis 28-kDa protein gene, /?25-5, partially sequenced 
15 previously (Reddy et al., 1998), was sequenced completely in the 
present invention. Also disclosed is the identification and 
characterization of a single locus in Exanis containing five Exanis 
28-kDa protein genes {p28-4, p28-5, p28-6 } p28-7 and p28-8). 

The Exanis 28-kDa proteins are homologous to 
20 Exhaffeensis OMP-1 family and the MAP-1 protein of C. 
rumanintium. The most homologous E. canis 28-kDa proteins (p28- 
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6, p28-7 and p28-8) are^ sequentially arranged in the locus. 
Homology of these proteins ranged from 67.5% to 72.3%. 
Divergence among these 28-kDa proteins was 27.3% to 38.6%. E 
canis 28-kDa proteins p28-4 and p28-5 were the least homologous 
with homology ranging from 50.9% to 59.4% and divergence of 5 3.3 
to 69.9%. Differences between the genes lies primarily in the four 
hypervariable regions and suggests that these regions are surface 
exposed and subject to selective pressure by the immune system. 



ffl Conservation of p28-7 among seven E. canis isolates has been 

fU 

10 reported (McBride et al. % 1999), suggesting that E. canis may be 

3 %l 



clonal in North America. Conversely, significant diversity of p2 8 



M> among £. chaff eensis isolates has been reported (Yu et al., 1999a). 

W 

J* All of the E. canis 28-kDa proteins appear to be post 

ru 

translationally processed from a 30-kD protein to a mature 28-kD 
15 protein. Recently, a signal sequence was identified on E. chaff eensis 
P28 (Yu et al.y 1999a), and N-terminal amino acid sequencing has 
verified that the protein is post-translationally processed resulting 
in cleavage of the signal sequence to produce a mature protein 
(Ohashi et al, 1998). The leader sequences of OMP-1F and OMP-1E 
20 have also been proposed as leader signal peptides (Ohashi et al, 
1998). Signal sequences identified on K chaff eensis OMP-1F, OMP- 
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IE and P28 are homologous to the leader sequence of E. canis 28- 
kDa protein. Promoter sequences for the p28 genes have not been 
determined experimentally, but putative promoter regions were 
identified by comparison with consensus sequences of the RBS, -10 
5 and -35 promoter regions of E. coli and other ehrlichiae (Yu et aL, 
1997; McClure, 1985). Such promoter sequences would allow each 
gene to potentially be transcribed and translated, suggesting that 
^ these genes may be differentially expressed in the host. Persistence 

ft 

fy of infection in dogs may be related to differential expression of p2 8 

m 

ru io genes resulting in antigenic changes in vivo, thus allowing the 
jjtj organism to evade the immune response. 

i* 

hi The E. canis 28-kda protein genes were found to exhibit 

Q nucleic acid and amino acid sequence homology with the £ 
chaffeensis omp-1 gene family and C. ruminantium map-1 gene. 
15 Previous studies have identified a: 30-kDa protein of E. canis that 
reacts with convalescent phase antisera against E. chaffeensis, but 
was believed to be antigenically distinct (Rikihisa et aL, 1994). 
Findings based on comparison of amino acid substitutions in four 
variable regions of E. canis 28-kDa proteins support this possibility. 
20 Together these findings also suggest that the amino acids 
responsible for the antigenic differences between E. canis and E 
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chaff eensis P28 are located in these variable regions and are readily 
accessible to the immune system. 

It was reported that immunoreactive peptides were 
located in the variable regions of the 28-kDa proteins of C. 
ruminantium, E. chaff eensis and E. canis (Reddy et aL, 1998). 
Analysis of E. canis and R chaff eensis P28 revealed that all of the 
variable regions have predicted surface-exposed amino acids. A 
study in dogs demonstrated lack of cross protection between E 
canis and R chaffeensis (Dawson and Ewing, 1992). This 
observation may be related to antigenic differences in the variable 
regions of P28 as well as in other immunologically important 
antigens of these ehrlichial species. Another study found that 
convalescent phase human antisera from E. chaffeensis-infected 
patients recognized 29/28-kDa protein(s) of R chaffeensis and also 
reacted with homologous proteins of E. canis (Chen et aL, 1997). 
Homologous and crossreactive epitopes on the E. canis 28-kDa 
protein and E. chaffeensis P28 appear to be recognized by the 
immune system. 

R. canis 28-kDa proteins may be important 
immunoprotective antigens. Several reports have demonstrated that 
the 30-kDa antigen of £. canis exhibits strong immunoreactivity 
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(Rikihisa et a/., 1994; Rikihisa et al, 1992). Antibodies in 
convalescent phase antisera from humans and dogs have 
consistently reacted with proteins in this size range from E 
chaffeensis and E. canis y suggesting that they may be important 
immunoprotective antigens (Rikihisa et aL, 1994; Chen et aL, 1994; 
Chen et aL, 1997). In addition, antibodies to 30, 24 and 21-kDa 
proteins developed early in the immune response to E. canis 
(Rikihisa et ai, 1994; Rikihisa et al. 7 1992), suggesting that these 
proteins may be especially important in the immune responses in 



tn 
ru 

CP 

rij 10 the acute stage of disease. Recently, a family of homologous genes 

■ encoding outer membrane proteins with molecular masses of 28- 

u 

L| kDa have been identified in E. chaffeensis, and mice immunized with 

i* 

Q recombinant £. chaffeensis P28 appeared to have developed 

ru 

immunity against homologous challenge (Ohashi et al, 1998). The 
15 P28 of E. chaffeensis has been demonstrated to be present in the 
outer membrane, and immunoelectron microscopy has localized the 
P28 on the surface on the organism, and thus suggesting that it may 
serve as an adhesin (Ohashi et al, 1998). It is likely that the 28-kDa 
proteins of E. canis identified in this study have the same location 
20 and possibly serve a similar function. 
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Comparison of p28-7 from different strains of £ canis 
revealed that the gene is apparently completely conserved. Studies 
involving £ chaffeensis have demonstrated immunologic and 
molecular evidence of diversity. Patients infected with £ chaffeensis 
have variable immunoreactivity to the 29/28-kDa proteins, 
suggesting that there is antigenic diversity. (Chen et aL, 1997). 
Recently molecular evidence has been generated to support 
antigenic diversity in the p 2 8 gene from £ chaffeensis (Yu et al., 
1999a). A comparison of five £ chaffeensis isolates revealed that 
two isolates (Sapulpa and St. Vincent) were 100% identical, but 
three others (Arkansas, Jax,_ 91HE17) were divergent by as much as 
13.4% at the amino acid level. The conservation of £ canis p28-7 
suggests that £ canis strains found in the United States may be 
genetically identical, and thus £ canis 28-kDa protein is an 
attractive vaccine candidate for canine ehrlichiosis in the United 
States. Further analysis of £ canis isolates outside the United States 
may provide information regarding the origin and evolution of £ 
canis. Conservation of the 28-kDa protein makes it an important 
potential candidate for reliable serodiagnosis of canine ehrlichiosis. 

The role of multiple homologous genes is not known at 
this point; however, persistence of E.canis infections in dogs could 
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conceivably be related to antigenic variation due to variable 
expression of homologous 28-kDa protein genes, thus enabling E 
canis to evade immune surveillance. Variation of msp-3 genes in A. 
marginale is partially responsible for variation in the MSP-3 protein, 
resulting in persistent infections (Alleman et al., 1997). Studies to 
examine 28-kDa protein gene expression by E. canis in acutely and 
chronically infected dogs would provide insight into the role of the 
28-kDa protein gene family in persistence of infection. 



EXAMPLE 8 



Identification of E. canis p28-l, p28-2 f p28-3 and p28-9 Genes 

Unknown regions of DNA upstream and downstream of 

15 the five gene locus of tandemly arranged p28 genes described above 
were sequenced by designing gene specific primers for p28-l 
(ECa28-75C) and p28-5 (ECa28-5-818f) to extend the p28 gene locus 
bidirectionally. Multiple gene walks were performed to obtain the 
unknown sequence as follows: 1.9-kp downstream of the 5 gene 

20 locus was amplified and sequenced using primers p28-5-818f (5'- 
TTA AAC ATA TGC CAC TTC GGA CTA-3', SEQ ID No. 34), producing a 
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900-bp amplicon, and 1191 (5'-TAT GAT CGT GTA AAA TTG CTG 
TGA GTA T-3\ SEQ ED No. 35), producing a 1-kb amplicon. The 
3.67-kbp of DNA upstream of the five gene locus was amplified and 
sequenced with primers ECa28-75C (5'-TAC TGG CAC GTG CTG GAC 
5 TA-3\ SEQ ID No. 36), producing a 1.6-kbp amplicon; ECa5'-1600 
(5'-CAC CAA TAA ATG CAG AGA CTT C-3\ SEQ ID No. 37), producing 
a 1.6-kbp amplicon; and 3125 (5'-AAT CCA TCA TTT CTC ATT ACA 

u GTG TG-3\ SEQ ID No. 38), producing a 800-bp amplicon. The locus 

Q 

^ of nine tandemly arranged genes consisting of the four new p28 

fit 

CH 10 genes, and the five p28 genes described above were designated p28- 

ru ^ . 

"P 1 through p28-9 (Figure 11). 

p ......... 

P. The nucleic acid and amino acid sequences of the R 

W 

M» canis p28 genes were aligned using the Clustal method to examine 

Q 

|5 J the homology between these genes. Homology of these proteins 
15 ranged from 67.5% to 75%, and divergence among these P28 
proteins was 26.9% to 38%. E. canis P28 proteins P28-1, P28-2, and 
P28-9 were the least homologous with the other p28 genes ranging 
from 37% to 49% and divergence of 53 to 77 %. The nucleic acid 
homology of the nine p28 genes ranged from 28 to 72%. The 
20 phylogenetic relationships based on the E. canis p28 amino acid 
sequences are shown in Figure 12. 
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Nucleotide sequence and accession numbers. The 
GenBank accession numbers for the nucleic acid and amino acid 
sequences for the complete nine gene £. canis (Jake strain) p28 gene 
locus is AF082744. This accession number was originally assigned 
to p28-7, but has been updated with the sequence of the nine gene 
p28 locus, which includes p28-7. GenBank accession numbers for 
nucleic acid and amino acid sequences of p28-7 in other E. canis 
isolates described in this study are: Louisiana, AF082745; 



ru 

lf\ 10 Oklahoma, AF082746; Demon, AF082747; DJ, AF082748; Fuzzy, 

ru 

-P AF082749; Florida, AF082750. 

s 

0 

i£ Multiple bands in the 28-kilodalton range have been 

W 

h& observed by immunoblots of convalescent sera from E. canis 

^ infected dogs (Rikihisa et al., 1994), and expression of multiple p2 8 

15 proteins could be an explanation for this observation. Southern blot 
studies suggest that other p28 genes, in addition to the five 
members of this locus, are present in the genome (McBride et al., 
1999; Ohashi et aL, 1998b). 

In this study a single gene locus containing nine 
20 tandemly arranged E. canis p28 genes encoding homologous, but 
nonidentical, p28 genes was identified. The nine gene locus 
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included four new p28 genes (Figures 13-16) and five tandemly 
arranged p28 genes that were reported above. Eight of the p28 
genes were located on one DNA strand, and one p28 gene was found 
on the complementary strand. The nucleic acid homology among 
5 the nine p28 gene members was 37 to 75%, and the amino acid 
homology ranged from 28 to 72%: 

The P28s of E. canis were found to be as closely related 
M> to 28-kilodalton proteins of other species such as E. chaffeensis as 

s 

m they are to themselves (McBride et aL, 2000). Differences among 

ru 

Cfi 10 the proteins are found primarily in several major hypervariable 

ru 

S P regions and suggest that these regions are surface exposed and 

5 

U subject to selective pressure by the immune system (McBride et al., 

W 

y> 2000). 

s 

ry Conservation of an E canis p28 gene (p28-7) among 

15 seven geographically different isolates has been reported (McBride 
et al., 1999), suggesting that E.canis may be highly conserved in 
North America. Similarly, the 120-kDa glycoprotein of E. canis is 
also conserved among isolates in the United States (Yu et al., 1997). 
In contrast, both the 120-kDa and the 28-kDa protein genes of E 
20 chaffeensis are divergent among isolates (Yu et al., 1999a; Chen et 
al., 1997). The diversity, of the 28-kDa protein gene of E 
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chaffeensis appeared to result from point mutations in the 
hypervariable regions perhaps due to selective immune pressure (Yu 
et al., 1999a). These data suggest that £. canis may have been 
introduced into North America relatively recently, and this may 
5 account for the conservation that was observed among isolates. The 
conservation of p28 genes in £. canis isolates may provide an 
opportunity to develop vaccine and serodiagnostic antigens that are 
iA particularly effective for disease prevention and serodiagnosis. A 

b 

Q mixture of the P28s may provide the most reliable serodiagnostic 

in ^ 

[j| 10 test, but serodiagnosis with a single P28 has been reported to be 

ru 

s p useful for immunodiagnosis (Ohashi et al., 1998b; McBride et al., 

□ 1999). 
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Any patents or publications mentioned in this 

specification are indicative of the levels of those skilled in the art to 

which the invention pertains. These patents and publications are 

herein incorporated by reference to the same extent as if each 

individual publication was individually indicated to be incorporated 

by reference. 

One skilled in the art will readily appreciate that the 
present invention is well adapted to carry out the objects and obtain 
the ends and advantages mentioned, as well as those inherent 
therein. The present examples along with the methods, procedures, 
treatments, molecules, and specific compounds described herein are 
presently representative of preferred embodiments, are exemplary, 
and are not intended as limitations on the scope of the invention. 
Changes therein and other uses will occur to those skilled in the art 
which are encompassed within the spirit of the invention as defined 
by the scope of the claims. 
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